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7, (New) A method of mining a collection of data to identify related concepts 
which occur at a frequency greater than a configurable threshold, 
comprising: 



attempting to associate a category with each word within each of the 
clauses; 

identifying mutually dependent relationships between the categorized 
words, according to at least one rule defining mutually dependent relationships 
between categorized words; 

grouping the identified mutually dependent relationships into groups of 
related mutually dependent relationships; and 

identifying unique concepts associated with the identified groups, wherein 
the identifying of unique concepts comprises recognizing the identified groups in 
which the related mutually dependent relationships exceed a configurable 
threshold. 



8. (New) The method of claim 7 wherein the categorized words identified as being 
in a mutually dependent relationship exist within separate clauses. 

9. (New) The method of claim 8 wherein the separate clause exist within separate 
sentences. 



10. (New) The method of claim 7 wherein the converting comprises conversion of 
the received collection of data into the same format, wherein the received 
collection of data is made up of various data formats. 

1 1 . (New) The method of claim 7 wherein the words comprises a coherent character 
string within the clauses. 



receiving a collection of data; 



converting the received data into a conmion format; 



combining the converted received data into clauses of related data; 
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12. (New) The method of claim 7 wherein the associating of a category with each key 
word comprises searching a category dictionary to identify a category which 
matches the key word. 

13. (New) The method of claim 10 wherein the category dictionary is comprised of 
combinations of original expressions, parts of speech, concepts and categories, 
wherein the original expressions are equivalent to the key words, the parts of 
speech are a classification of the key words, the concepts are replacement 
expressions for the key words and the categories represent a larger group having 
the nature of the key word. 

14. (New) The method of claim 7 wherein the mutually dependent relationships are 
determined according to rules applied to keywords and their associated categories 
within a clause. 

15. (New) The method of claim 7 wherein identifying the groups in which the related 
mutually dependent relationships exceed a configurable threshold is achieved 
automatically through utilization of computer generated or user defined rules in 
which the configurable threshold is defined for each category. 

16. (New) The method of claim 7 wherein the configurable threshold is provided 
manually by a user for each of the identified groups of mutually dependent 
relationships in response to the user being presented a display of all the groups of 
mutually dependent relationship, at a GUI. 

17. (New) The method of claim 7 further comprises presenting the identified groups 
in which the configurable threshold was exceeded to a user at a GUI. 

18. (New) The method of claim 7 further comprises comparing the identified groups 
in which the configiu-able threshold was exceeded against a set of rules to 
determine a course of action, wherein the course of action comprises 
automatically notifying a user that the exceeding of the threshold, logging the 
exceeding of the threshold in a user accessible file, or ignoring the exceeding of 
the threshold. 

19. (New) The method of claim 7 wherein the set of rules are predefined. 

20. (New) The method of claim 7 wherein the set of rules are entered manually by a 
user during a request by the user for specific relationships between categories. 



whereby mutually dependent relationships by categories are 
defined by the user according to the user's needs. 
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21. (New) An article of manufacture, embodying logic to perform a method of 
mining a collection of data to identify related concepts which occur at a frequency 
greater than a configurable threshold, comprising: 

receiving a collection of data; 

converting the received data into a common format; 

combining the converted received data into clauses of related data; 

attempting to associate a category with each key word in each of the 
clauses; 

identifying mutually dependent relationships between the categorized 
words within each of the clauses; 

grouping the identified mutually dependent relationships into groups of 
relates mutually dependent relationships; and 

identifying the groups including mutually dependent relationships which 
occur at a frequency greater than a configurable threshold, 

whereby a search query is not required. 

22. (New) The article of manufacture of claim 21 wherein converting comprises 
conversion of the received collection of data into the same format, wherein the 
received collection of data is made up of various data formats. 

23. (New) The article of manufacture of claim 21 wherein the key word comprises a 
coherent character string within the clauses. 



24. (New) The article of manufacture of claim 21 wherein the associating of a 
category with each key word comprises searching a category dictionary to 
identify a category which matches the key word. 

25. (New) The article of manufacture of claim 24 wherein the category dictionary is 
comprised of combinations of original expressions, parts of speech, concepts and 
categories, wherein the original expressions are equivalent to the key words, the 
parts of speech are a classification of the key words, the concepts are replacement 
expressions for the key words and the categories represent a larger group having 
the nature of the key word. 

26. (New) The article of manufacture of claim 21 wherein the mutually dependent 
relationships are determined according to rules appUed to keywords and their 
associated categories within a clause. 
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27. (New) A system for data mining, comprising: 

means for receiving a collection of data; 

means for converting the received data into a common format; 

means for combining the converted received data into clauses of related 

data; 

means for attempting to associate a category with each key word in each 
of the clauses; 

means for identifying mutually dependent relationships between the 
categorized words within each of the clauses; 

means for grouping the identified mutually dependent relationships into 
groups of relates mutually dependent relationships; and 

means for identifying the groups including mutually dependent 
relationships which occur at a firequency greater than a configurable threshold, 

whereby a search query is not required. 

28. (New) A data mining service to accept a collection of data from a customer and to 
utilize a data mining system to identify related concepts which occur at a 
frequency greater than a configurable threshold, the data mining service 
comprising: 

means for receiving a collection of data from a customer; 

means for converting the received data into a common format; 

means for combining the converted received data into clauses of related 

data; 

means for attempting to associate a category with each key word in each 
of the clauses; 

means for identifying mutually dependent relationships between the 
categorized words within each of the clauses; 

means for grouping the identified mutually dependent relationships into 
groups of relates mutually dependent relationships; 

means for identifying the groups including mutually dependent 
relationships which occur at a fi"equency greater than a configurable threshold; 
and 

means for providing the customer with the identified groups including 
mutually dependent relationships which occur at a firequency greater than a 
configurable threshold. 
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